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Abstract. String languages recognizable in (deterministic) log-space are characterized 
either by two-way (deterministic) multi-head automata, or following Immerman, by first- 
order logic with (deterministic) transitive closure. Here we elaborate this result, and match 
the number of heads to the arity of the transitive closure. More precisely, first-order logic 
with fc-ary deterministic transitive closure has the same power as deterministic automata 
walking on their input with k heads, additionally using a finite set of nested pebbles. This 
result is valid for strings, ordered trees, and in general for families of graphs having a fixed 
automaton that can be used to traverse the nodes of each of the graphs in the family. Other 
examples of such families are grids, toruses, and rectangular mazes. For nondeterministic 
automata, the logic is restricted to positive occurrences of transitive closure. 

The special case of k — 1 for trees, shows that single-head deterministic tree-walking 
automata with nested pebbles are characterized by first-order logic with unary determin- 
istic transitive closure. This refines our earlier result that placed these automata between 
first-order and monadic second-order logic on trees. 



The complexity class DSPACE(logn) of string languages accepted in logarithmic space 
by deterministic Turing machines, has two well-known distinct characterizations. The first 
one, see e.g., [30J (or Corollary 3.5 of [32]), is in terms of deterministic two-way automata 
with several heads working on the input tape (and no additional storage). Second, Im- 
merman [33j showed that these languages can be specified using first-order logic with an 
additional deterministic transitive closure operator - it is one of the main results in the 
field of descriptive complexity |16[ l35j. Similar characterizations of NSPACE(logn) hold for 
their nondeterministic counterparts |33[ [Ml [61] . 

So we have a match between two distinct ways of specifying languages (two-way multi- 
head automata and first-order logic with transitive closure) that both have a natural param- 
eter indicating the relative complexity of the mechanism used. For multi-head automata 
the parameter is the number of heads used to scan the input; indeed, two heads are more 
powerful than a single one, as two heads can be used to accept nonregular languages like 
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{a n b n | n € N}, whereas single- head two-way automata only accept regular languages 
[53\ 158]. In fact, it is known that in general k + 1 heads are better than k even for a single- 
letter input alphabet [44j . For transitive closure logics, the parameter is the arity of the 
transitive closure operators used: if 4>(x,y) denotes a formula that relates two sequences 
x,y of k variables each, then <f>*(x,y~) denotes the transitive (and reflexive) closure of (f> 
- we will call this k-aiy transitive closure, and it is said to be deterministic if <j> deter- 
mines y as a function of x. Clearly, binary transitive closure is more powerful than unary: 
{a n b n | n £ N} can easily be described in first-order logic with binary deterministic transi- 
tive closure, but first-order logic with unary transitive closure defines the regular languages 
[2l I16|, [52] . It seems to be open whether (k + l)-ary transitive closure is more powerful than 
A;-ary transitive closure, see [29^. 

In [2], Bargury and Makowsky set out to characterize the formulas that capture the 
power of automata with k heads, and they found a class of formulas, called A;-regular, 
with this property. Apart from first-order concepts, the formulas only use k-ary transitive 
closure. They show how fe-head automata can be described by fe-ary transitive closure (both 
deterministically and nondeterministically) but for the converse the k-ary regular formulas 
only work in the nondeterministic case: "the modification of the A:-regular formulas needed 
to take out the nondeterminism will spoil their elegant form, and we do not pursue this 
further" [2]. 

Here we set out from the other side. Starting with the full set of deterministic k- 
ary transitive closure formulas we want to obtain an equivalent notion of deterministic 
automata. Indeed, we succeed in doing this, i.e., we have an automata-theoretic character- 
ization of first-order logic with deterministic fc-ary transitive closure, but we pay a price. 
The two-way automaton model we obtain has k heads, as expected, but is augmented with 
the possibility to put an arbitrary finite number of pebbles on its input tape, to mark posi- 
tions for further use. If these pebbles can be used at will it is folklore [541 [51~] that we obtain 
again DSPACE(logn), a family too large for our purpose. Instead we only allow pebbles 
that are used in a LIFO (or nested) fashion: all pebbles can be 'seen' by the automaton 
as usual, but only the last one dropped can be picked up [26l EH [43j ESI E21 [49] . On the 
other hand our pebbles are more flexible than the usual ones: they can be 'retrieved from 
a distance', i.e., a pebble can be picked up even when no head is scanning the position of 
the pebble (cf. the "abstract markers" of [1]). 

In the nondeterministic case we have to restrict ourselves to formulas with positive 
occurrences of the k-ary transitive closure operator, as we do not know whether the class 
of languages accepted by nondeterministic A:-head two-way automata with nested pebbles 
is closed under complement. 

In fact, our equivalence result (Theorem 15 .3[) is stated and proved for ranked trees in 
general, of which strings are a special case. Our automaton model (a tree-walking automa- 
ton) visits the nodes of an input tree, moving up and down along the edges of the tree. The 
moves (and state changes) are determined by the state of the automaton and the label of 
(and pebbles on) the nodes it visits; additionally we assume that the children of each node 
are consecutively numbered and that the automaton can distinguish this number. 

In Section [5] we translate logical formulas into automata, following [19] and additionally 
using the technique of Sipser [59] to deterministically search a computation space. Section [5] 
considers the reverse: translating automata into logical formulas. As in [2] we adapt Kleene's 
construction to obtain regular formulas from automata, thus getting rid of the states of the 
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automaton (Lemma 15. If) . but we need to iterate that construction: once for each nested 
pebble. 

In Section [6] we summarize the results of our paper for single- head automata on trees, 
the context of our previous papers [19j [21] dealing with tree-walking automata and the 
quest of obtaining simple tree automaton models that are inherently sequential (unlike the 
classic tree automata) and still capture the full power of regular tree languages. 

Finally, in Section [7] we discuss how to extend our results to more general graph-like 
structures, such as unranked trees (important for XML [4_3j [65j H3 E3 EZ] ) , cycles, grids (as 
in [2]; important for picture recognition [U [56J, [25J, HU HQ]), toruses, and, for k > 2, mazes 
[T2"l 13 lj . To have a meaningful notion of graph- walking automaton we only consider 
graphs with a natural locality condition: a node cannot have two incident edges with the 
same label and the same direction. Note that unranked trees satisfy this condition when 
we view them as graphs with 'first child' and 'next sibling' edges in the usual way. Two- 
dimensional grids satisfy it by distinguishing between horizontal and vertical edges. For 
all such graphs, our result holds in one direction: from automata to logical formulas. The 
other direction holds for all families of such graphs for which there exists a single-head 
deterministic graph-walking automaton (with nested pebbles) that can traverse each graph 
of the family, visiting each node at least once. This includes all families mentioned above 
(except mazes, for which the automaton has two heads), and also, trivially, the family of 
'ordered' graphs, i.e., all graphs in which the successor relation of a total order is determined 
by edges with a specific label. Note that the existence of such an automaton is needed to 
implement even simple logical formulas such as 'all nodes have label o~\ 

The main result of this paper, but only for trees and k = 1 (cf. Section [6]), was first 
presented at a workshop in Dresden in March 1999, but unfortunately did not make it 
into the proceedings [66J. Muscholl, Samuelides, and Segoufin [35] have 'reconstructed' 
our missing result, independently obtaining the closure under complementation of the tree 
languages accepted by deterministic tree-walking automata with nested pebbles, taking 
special care to minimize the number of pebbles needed. The results of this paper were then 
presented at STACS 2006 [20]. 

2. Preliminaries 

Trees. A ranked alphabet is a finite set £ together with a mapping rank : £ — > N. 
Terms over E are recursively defined: if a € £ is of rank n, and t\,...,t n are terms, 
then a(ti, . . . , t n ) is a term. In particular a is a term for each symbol a of rank 0. 

As usual, terms are visualized as trees, which are special labelled graphs; cr(ti, . . . ,t n ) 
as a tree which has a root labelled by a and outgoing edges labelled by 1, ... ,n leading 
to the roots of trees for t%, . . . , t n . The roots of subtrees t±, . . . , t n are said to have child 
number 1, . . . , n, respectively; by default the child number of the root of the full tree equals 
0. The set of all trees (terms) over ranked alphabet £ is denoted by Tg. 

Strings over an alphabet £ can be seen as a special case: they form 'monadic' trees 
over £ U {J-}, where rank(cx) = 1 for each a E £, and rank(X) = 0. 

Tree- walking automata. For k > 1, a fc-head tree-walking automaton is a finite-state 
automaton equipped with k heads that walks on an input tree (over a given ranked alphabet 
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E) by moving its heads along the edges from node to nodd3. At each moment it determines 
its next step based on its present state, and the label and child number of the nodes visited. 
Accordingly, it changes state and, for each of its heads, it stays at the node, or moves either 
up to the parent of the node, or down to a specified child. If the automaton has no next 
step, we say it halts. 

The language L(A) Q Ts accepted by the fc-head tree- walking automaton A is the set 
of all trees over E on which A has a computation starting with all its heads at the root of 
the tree in the initial state and halting in an accepting state, again with all heads at the 
root of the tree. The family of languages accepted by fe-head deterministic tree-walking 
automata is denoted by DW fe A, for nondeterministic automata we write NW fc A. 

Such an automaton is able to make a systematic search of the tree (which can be tuned 
to be, e.g., a preorder traversal), even using a single head, as follows. When a node is 
reached for the first time (entering it from above) the automaton continues in the direction 
of the first child; when a leaf is reached, the automaton goes up again. If a node is reached 
from below, from a child, it goes down again, to the next child, if that exists; otherwise the 
automaton continues to the parent of the node. The search ends when the root is entered 
from its last child. This traversal is often used in constructions in this paper. 

In both [H] and [50] . as an example, the authors explicitly construct a deterministic 1- 
head tree- walking automaton that evaluates boolean trees, i.e., terms with binary operators 
'and' and 'or' and constants and 1. 

Again, strings form a special case. Tree-walking automata on monadic trees are equiv- 
alent to the usual two-way automata on strings. A tree- walking automaton is able to 
recognize the root of a tree as well as its leaves (using child number and rank of the sym- 
bols). This corresponds to a two-way automaton moving on a tape, where the input string 
is written with two endmarkers so the automaton knows the beginning and end of its input. 

Logic for Trees. For an overview of the theory of first-order and monadic second-order 
logic on both finite and infinite strings and trees in relation to formal language theory, see 
[63]. 

In this paper our primary interest is in first-order logic, describing properties of trees. 
The logic has node variables x, y, . . . , which for a given tree range over its nodes. There are 
four types of atomic formulas over E: lab a (x), for every a E E, meaning that x has label 
a; edgj(x, y), for every i at most the rank of a symbol in E, meaning that the i-th child of 
x is y; x < y, meaning that x is an ancestor of y; and x = y, with obvious meaning. The 
formulas are built from the atomic formulas using the connectives A, and V, as usual; 
variables can be quantified with 3 and V. 

If t is a tree over E, (j) is a formula over E such that its free variables 
and Ui, . . . , u n are nodes of t, then we write t \= <p(u\, . . . , u n ) if formula <j) holds for t where 
the free valuated as u,-. 

For fixed k > 1, by overlined symbols like x we denote /c-tuples of objects of the type 
referred to by x, like logical variables, nodes in a tree, or pebbles used by an automaton. 
By x[i] we then denote the i-th component of x. 

We consider the additional operator of A:-ary transitive closure. Let (ft(x, y) be a formula 
where x, y are /c-tuples of distinct variables occurring free in <f>. We use <f>* (x, y) to denote 
the transitive closure of <f> with respect to x, y. Informally, <fi* (x, y) means that we can make 
a series of jumps from /c-tuple x to /c-tuple y such that each pair of consecutive fc-tuples 
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x',y' connected by a jump satisfies (f)(x',y'). More formally, let (ft have 2k + m free variables 
x,y, zi, . . . , z m . For tree t and nodes u, v, wi, . . . , w m of t we have t \= (f>*(u, v, wi, . . . , w m ) if 
there exists a sequence of /c-tuples of nodes uo,^i, • • • ,u n , n > 0, such that u = uo, v = u n , 
and t \= (j)(ui,Ui + i,wi, . . . , w m ) for each < i < n. In particular, t \= <p*(u,u, w±, . . . , w m ) 
for every /c-tuple u of nodes of t. 

Formally we should specify the fc-tuples x, y, or rather x' ,y' , of free variables with 
respect to which to take the transitive closure, like for the usual universal and existential 
quantification, and write (tc(x', y')<p(x', y'))(x, y) instead of (p*(x,y). 

A predicate (fr(x, y) with free variables x, y is functional (in x, y) if for every tree t and 
&-tuple of nodes u there is at most one /c-tuple v such that t \= v). If 4> has more 
free variables than x, y, this should hold for each fixed valuation of those variables. The 
transitive closure (f)*(x,y) is deterministic if 4> is functional (in the variables with respect 
to which the transitive closure is taken). Instead of requiring <p to be functional we could, 
equivalently, require it to be of the form ip(x,y) A Vz(ip(x, z) — > y = z). This has the 
advantage of being a decidable property, but it is less convenient in proofs. 

The tree language defined by a closed formula <\> over X consists of all trees t in Ts 
such that t \= (p. The family of all tree languages that are first-order definable is denoted 
by FO; if one additionally allows /c-ary transitive closure or deterministic transitive closure 
we have the families FO+TC fc and FO+DTC fc , respectively. General transitive closure and 
deterministic transitive closure (i.e., over unbounded values of A;) characterize the com- 
plexity classes NSPACE(logn) and DSPACE(logn) (for strings, or more generally, ordered 
structures), respectively, see [TBI [35] . 

By LFO we denote the family of tree languages definable in local first-order logic, i.e., 
dropping the atomic formula x < y. One should note however, that x < y is the transitive 
closure of the functional predicate c x parent of y\ i.e., \J i edg i (x,y). Hence x < y is 
expressible in LFO+DTC 1 , and the families FO+DTC fc , etc., of tree languages definable in 
first-order logic with transitive closure, do not change by this restriction. 

In Section [6] we study the specific case k = 1, i.e., we consider unary transitive closure 
only. The family FO+TC 1 is included in the family of regular tree languages, i.e., the tree 
languages that are definable in monadic second-order logic, which additionally has node set 
variables X,Y, . . . , ranging over sets of nodes of the tree; it allows quantification over these 
variables, and has the predicate x € X, with its obvious meaning. 

Example 2.1. It is proved in [8] that there is a regular tree language T that cannot be 
accepted by any single-head nondeterministic tree-walking automaton. Here we illustrate 
how to construct an FO+DTC 1 formula for that language. 

Let X = {a, b, c}, where a and b are miliary (labelling, of course, the leaves of the trees 
over S) and c is binary (labelling the internal nodes). The language T consists of all trees 
over X for which the path to each leaf labelled by a contains an even number of 'branching' 
nodes, i.e., internal nodes for which both the left and right subtree contain an a-labelled 
leaf. 

It is easy to construct a first-order formula expressing that a node is branching. Let 
ip(x, y) specify that y is the lowest branching ancestor of x, and let 4>(x, y) = (3z)(tp(x, z) A 
ip{z,y)) to claim y is the second lowest branching ancestor of x. Observe that 4> is func- 
tional. Now T is specified by the FO+DTC 1 formula (Vx)(lab a (x) -> (3y)[ 4>*{x,y) A 
^(3z)^y,z)]). 

In fact T even belongs to FO, as observed in [8], but earlier in |52(, Lemma 5.1.8]. □ 
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3. Tree- Walking Automata with Nested Pebbles 

A £;-head tree-walking automaton with nested pebbles is a /c-head tree-walking automa- 
ton that is additionally equipped with a finite set of pebbles. During the computation it 
may drop these pebbles (one by one) on nodes visited by its heads, to mark specific posi- 
tions. It may test the currently visited nodes to see which pebbles are present. Moreover, 
it may retrieve a pebble from anywhere in the tree, provided the life times of the pebbles 
are nested. This can be formalized by keeping a (bounded) stack in the configuration of 
the automaton, pushing and popping pebbles when they are dropped and retrieved, respec- 
tively. Note that since this stack is bounded by the number of pebbles, it can also be kept 
in the finite control of the automaton!! Pebbles can be reused any number of times (but 
there is only one copy of each pebble). Accepting computations should start and end with 
all heads at the root without pebbles on the input tree. 

The family of tree languages accepted by deterministic fe-head tree-walking automata 
with nested pebbles is denoted by DPW fe A, the nondeterministic variant by NPW fc A. 

Some specific properties of these pebbles must be stressed. First, as stated above, 
pebbles are used in a LIFO manner, as in [26\ \19\ H31 [65j [22], I49j. which means that only the 
last one dropped can be retrieved, and thus their life times on the tree are nested. Without 
this restriction again the classes DSPACE(logre) and NSPACE(logn) would be obtained. 
Second, this is rather nonstandard, the automaton need not return one of its heads to the 
position where a pebble was dropped in order to pick it up: at any moment the last pebble 
dropped can be retrieved. This means that the pebble behaves as a pointer: we can store 
the address of a node when we know it (which is the case when we visit it) and we can 
later wipe the address from memory without the need to return to the node itself. Such 
pebbles were called "abstract markers" in [3] (to distinguish them from the usual "physical 
markers"). Finally, as opposed to [26], during the computation all pebbles dropped remain 
visible to the automaton (and not only the one or two on top of the stack). 

Example 3.1. The regular tree language T from Example 12.11 cannot be accepted by any 
single- head nondeterministic tree- walking automaton (without pebbles), as proved in [8]. 
As an example, here we show how to accept T by a (single-head) deterministic tree-walking 
automaton with two nested pebbles. 

Using a preorder traversal of the input tree, the first pebble is placed consecutively on 
leaves labelled by a. For each such position, starting at the leaf we follow the path upwards 
to the root counting the number of branching nodes. To test whether an internal node is 
branching we place the second pebble on the node and test whether its other subtree, i.e., 
the subtree that does not contain the first pebble, contains an a-labelled leaf (using again a 
preorder traversal of that subtree, the root of which can be recognized through the second 
pebble which marks the parent of that root). After testing the node, we pick up the second 
pebble. At the root, we reject whenever we count an odd number of such nodes on a path; 
otherwise we return to the position of the first pebble (using another preorder traversal of 
the tree). 

If the automaton uses pebbles xi, ...,x n , then the contents of the stack can be any string over 
{x\, . . . ,x n } in which each x% occurs at most once. It can be assumed w.l.o.g. that the stack always 
contains X\X^ ■ ■ ■ Xi for some i, but we will do this only in the proof of Lemma |5 . 2 1 ("where . in fact, the order 
is reversed). 
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In fact, the language can be accepted with just one pebbl^l (which is nested trivially). 
As explained above, the pebble can be used to detect all branching nodes, which, together 
with all a-labelled leaves, can be viewed as a binary tree. To check, for that tree, that the 
path to each leaf is of even length, the automaton performs a preorder traversal and counts 
its number of steps, modulo 2. At each leaf the count should be 0. □ 

To fix the model, a fc-head tree-walking pebble automaton is specified as a tuple A = 
(Q,Ti,X,qo,A,I), where Q is a finite set of states, £ is a (ranked) input alphabet, X a 
finite set of pebbles, qo 6 Q the initial state, A C Q the set of accepting states, and I the 
finite set of instructions. 

Each instruction is a triple of the form (p, ip, q) or (p, (p, q) or (p, ~tp, q), where p,q G Q 
are states, ip is an operation, and (p a test. There are four types of operations: upj, downy 
(moves), dropj(x), and retrieve(x) (pebble operations), and three types of tests: labj i(7 , 
pebj(aj), and chnoy, where in each case i indicates a head (1 < i < k), j is a child number 
(1 < j < max{rank(<r) | a G £}), o G £ is a node label, and x G X is a pebble. An 
instruction (p, x, q) is called an outgoing instruction of state p. 

The automaton A is deterministic if for any pair (p,XiiQi)-> (PiX2,Q2) of distinct in- 
structions starting in the same state, either xi = ~X2 or X2 = 

A configuration of A on tree t over £ is a triple [p, u, a], where p G Q is a state, H is a 
/c-tuple of nodes of t indicating the positions of the k heads, and a = (xi,wi) ■ ■ ■ (x m , w m ) 
the stack of pebbles dropped at their positions (m > 0, Xj G X, Wj a node of t). The initial 
configuration equals [qo, root, e], where root consists of k copies of the root of t, and e is the 
empty stack. 

The semantics of the pebble automaton is defined using the step relation on 
configurations for automaton A on input tree t. We have [p, u, a] \~A,t [q,v,(3] with 
a = (x\, w±) ■ ■ ■ (x m , w m ), if there exists an instruction (p, x, q) such that 

if then 

X = upj v[i] is the parent of u[i], v[h] = u[h] for h ^ i, a = (3 

downy v[i] is the j-th child of u[i], v[h] = u[h] for h ^ i, a = /3 

dropj(x) P = a(x, u[i]),x ^ {x±, . . . , x m },u = v 

retrieve(x) m > 1, P(x m ,w m ) = a, x = x m ,u = v 

labj i(7 u[i] has label cr, u = v, a = p 

pebj(x) (x, u[i}) occurs in a, u = v, a = P 

chnojj the child number of u[i] is j, u = v, a = P 

or in case of the negative tests, x = ~ labj jCr , ~ pebj(x), ~ chnojj, the tests above are 
negated, whereas head positions and pebble stack remain unchanged (u = v, a = P). 

A configuration c is halting if there is no d such that c d, and it is accepting if it 
is halting and c = [p, root,e] for some p G A. Then the language accepted by A is defined 
as L(A) = { t G Ts | [gojroot,e] \~\ t c for some accepting configuration c }. 

Example 3.2. Let £ be as in Example 12.11 We write, in our formalism, a deterministic 
single-head tree- walking automaton A (without pebbles) such that L(A) consists of all trees 
over £ that have a-labelled leaves only. The automaton performs a preorder traversal of 
the input tree. The main states are as follows. In state 1 we move down to the left child 
until we reach a leaf, in state 2 we are at a left child and move to its right sibling, and in 
state 3 we move up until we are at a left child or at the root. 



3 As brought to our attention by Christof Loding (and his student Gregor Hink). 
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As the automaton has only a single head, we omit the head number in the instructions. 
The initial state is 1, the only accepting state is h. The automaton has the following 
instructions: 

(1, lab c ,l'), (l'.dowm.l), (1, -label"), (l",lab a ,3), 
(3,chno 2 ,3'), (3', up, 3), (3, ~ chno 2 , 3"), 
(3",chnoi,2), (3",~chnoi,/i), 

(2, up, 2'), (2',down 2) l). □ 



4. From Logic to Nested Pebbles 

We now generalize the inclusion FO C DPW 1 A from Section 5 of [19| , on the one side 
introducing k-aiy transitive closure, on the other side allowing k heads. Note also the result 
is for the 'pointer' variant of pebbles, rather than pebbles that have to be picked up where 
they were dropped. 

Lemma 4.1. For trees over a ranked alphabet, andk>\, FO+DTC fc C DPW fe A. 

Proof. The proof is by induction on the structure of the formula. For each first-order formula 
with deterministic transitive closure we construct a deterministic tree-walking automaton 
with nested pebbles that always halts (with all its heads at the root). The additional effort 
we have to take to make it always halting, will pay itself back when we deal with negation, 
but is also helpful when considering disjunction and existential quantification. Generally 
speaking, each variable of the formula acts as a pebble for the automaton. In case of fc-ary 
transitive closure we need 3k pebbles to test the formula by an automaton. Most features 
can be simulated using a single head, moving pebbles around the tree, only for transitive 
closure we need all the k heads. 

As intermediate formulas may have free variables we need to extend our notion of 
recognizing a tree by an automaton: a valuation of the free variables is fixed by putting 
pebbles on the tree, one for each variable, and the automaton should evaluate the formula 
according to this valuation. 

More formally, let (j> = 4>(x\, . . . , x n ) be a formula with free variables x±, . . . , x n . The 
automaton A for 4> should check whether t \= <f>{u\, . . . , u n ) for nodes u±, . . . ,u n in a tree 
t, as follows. It is started in the initial state with all heads at the root of the tree t, 
where u%, . . . , u n are marked with pebbles xi,...,x n . During the computation A may use 
additional pebbles (in a nested fashion) and it may test xi, . . . , but it is not allowed 
to retrieve them. The computation should halt again with all heads at the root of t with 
the original configuration of pebbles. The halting state is accepting if and only if t \= 
(f)(ui, . . . ,u n ). 

For the atomic formulas it is straightforward to construct (single-head) automata. As 
an example, for <j> = x < y the automaton searches the tree for a marked node representing 
y. From that position the automaton walks upwards to the root, where it halts, signalling 
whether x was found on the path from y to the root. For edgj(x, y) the automaton searches 
for x, then determines whether x has an i-th child (the arity of the node can be seen from 
its label) and moves to that child. There the automaton checks whether pebble y is present. 

For the negation eft = -\<f>i of a formula we use the original automaton for <f>i, but change 
its accepting states to the complementary set. This construction works thanks to the fact 
that the automata we build are always halting. 
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A similar argument works for the conjunction <p = 0i A</>2, or the disjunction <p = 4>i\/(f>2, 
of two formulas. We may run the two automata constructed for the two constituents 
consecutively. Note that the free variables in (pi and <p2 need not be the same, but the extra 
pebbles that need to be present for <j) are ignored by each of the two automata. 

For quantification (p = (Vx)(/>i the automaton makes a systematic traversal through the 
tree, using a single head. When it reaches a node (for the first time) it places a pebble x at 
that position. Then it returns to the root, and runs the automaton for (pi as a subroutine; 
the free variable x of (pi is marked by the pebble, as requested by the inductive hypothesis. 
When this test for (pi(x) is positive, i.e., the subroutine halts at the root in an accepting 
state, the automaton returns to the node marked x (by searching for it in the tree), picks 
up the pebble, and places it on the next node of the traversal. When the automaton has 
successfully run the test for (pi for each node it accepts. The formula (3x)(pi is treated 
similarly. 

The main new element of this proof compared to [19J is the introduction of k-avy 
transitive closure. Here we need to program a walk from one fc-tuple of nodes to another k- 
tuple with 'jumps' specified by a 2/c-ary formula. We cannot do this in a straightforward way, 
as we might end 'jumping around' in a cycle without noticing. Such an infinite computation 
violates the requirement that our automaton should always halt. We use a variant of the 
technique of Sipser [59] to avoid this trap, and run this walk backwards. 

So, let <p = (p\ be the transitive closure of a functional 2A;-ary predicate (pi. Given a 
tree with 2k nodes marked by pebbles x and y we have to construct an automaton A that 
decides whether we can connect the /c-tuples x and y by a series of intermediate /c-tuples 
such that (pi holds for each consecutive pair. In what follows we assume that (pi has 2k free 
variables x and y (with respect to which the transitive closure is taken), and we disregard 
the remaining free variables of (pi (the values of which are fixed by pebbles) . 

Now consider the set of /c-tuples of nodes of the input tree t, spanning the (virtual) 
computation space of A. We build a directed graph on these /c-tuples by connecting vertesjj 
u to vertex v if t \= (pi(u,v), i.e., the pair (u,v) in t satisfies (pi(x,y). As (pi is functional, 
for each vertex there is at most one outgoing arc. Thus, if we fix a vertex v (of k nodes in t) 
and throw away the outgoing arc of v (if it exists) the component of vertices connected to v 
in this graph forms a tree tk(v), with arcs pointing towards the root v rather than towards 
the leaves. Note this is a directed tree in the graph-theoretical sense; there is no bound on 
the number of arcs incident to each vertex. 

Of course, this tree t^ilj) with 0i-arcs consists of all vertices u that satisfy t \= (p(u,v), 
and fixing v to be the vertex marked by the pebbles y the new automaton A traverses that 
tree tk(v) and tries to find the vertex marked by pebbles x. 

However, the tree tk(v) is not explicitly available, and has to be reconstructed while 
walking on the input tree t, using the automaton A\ for (pi as a subroutine. In particular, 
we want to implement a traversal on tk(v). As the vertices of tk(v) consist of /c-tuples of 
nodes of t, we order these fc-tuples in a natural way using the lexicographical ordering based 
on the preorder in t. In this way we impose an ordering on the children of each vertex of 
tk(v), thus allowing the usual preorder traversal of tk(v) as described below. To find the 
successor of a fc-tuple z in the lexicographical ordering we act like adding one to a /c-ary 
number: change the last coordinate of the tuple z into its successor (here the preorder 



For clarity we distinguish 'node' in the input tree from 'vertex' in the computation space, i.e., a fc-tuple 
of nodes. Similarly we use 'edge' and 'arc'. 
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successor in t) if that exists, otherwise reset that coordinate to the first element (here the 
root of t), and consider the last-but-one coordinate, etc. 

We implement a preorder traversal of t^iv), which means we compute the preorder 
successor of each vertex of the tree tk(v) whose arcs are defined by <f>\, and where the 
ordering between sibling vertices is based on the lexicographical ordering of nodes in t: 

preorder successor of vertex u in tk(v): 
if it exists, the first child of u, 
else, on the path of u to the root v, 

the right sibling of the first vertex that has one. 

We traverse the tree tk(v), with 2k pebbles x and y fixed, with the help of 3k additional 
pebbles x', y', and z'. During this traversal, A keeps track of the current vertex of tk(v) with 
its k heads. Initially the heads move to y, i.e., to v. Note that the order of dropping the 
pebbles x' and y' differs in the two cases below: in the first case we have to check <pi(x',y') 
'backwards', finding x' given y', while in the second case it is the other way around. This 
is reflected in the order of dropping x! and y' . 

First, we describe how to check whether the current vertex has a first child in tk(v), and 
to go there if it exists. We drop pebbles y' to fix the current vertex, and we systematically 
place pebbles x' on each candidate vertex, i.e., each fc-tuple of nodes of the tree t (except 
v). Thus, lexicographically, in each step the last pebble of x' is carried to the next node 
in t (with respect to the preorder in t), but when that pebble has been at all nodes, it is 
lifted, the last-but-one pebble is moved to its successor node in t, and the last pebble is 
replaced on the root, etc. For each fc-tuple x' we check 4>i(x',y') using automaton Ai as a 
subroutine. If the formula is true, we have found the first child in t^iy) and we move the k 
heads to the nodes marked by x', lift pebbles x', and retrieve pebbles y' (from a distance). 
If the formula is not true, we move x' to the next candidate vertex as described above (but 
v is disregarded). If none of the candidates x' satisfies 4>i(x', y'), the vertex y' obviously has 
no child in tk{v). 

Second, we describe how to check for a right sibling in tk(v), and go there if it exists, 
or go up (to the parent of the current vertex) otherwise. The problem here is to keep the 
pebbles in the right order, adhering to the nesting of the pebbles. First drop pebbles x' 
on the current vertex. Then determine its parent in tk(v); this is the unique vertex that 
satisfies 4>i(x',y r ), where y' marks the parent vertex of a;', thanks to the functionality of (j>\. 
It can be found in a traversal of all fc-tuples of nodes of t using pebbles y' and subroutine 
Ai (as described above for the first child). Leave y' on the parent and return to x' (by 
searching for x' in the tree t). Using the third set of k pebbles z', traverse the /c-tuples of 
nodes of t from x' onwards and try to find the next fc-tuple that satisfies 4>i(z',y') when z' 
is dropped. If it is found, it is the right sibling of x' . Return there, lift ~z' , and retrieve y' 
and x! . If no such fc-tuple is found, the current vertex has no right sibling, and we go up in 
the tree tk(v), i.e., we return to y' . Here we lift y' and retrieve x! . 

In all these considerations special care has to be taken of the root v. It has no parent 
in tk(v). Fortunately v is clearly marked by pebbles y. □ 

The number of pebbles needed to compute a formula of FO+DTC fc according to the con- 
struction above depends only on the nesting of quantifiers and transitive closures in the 
formula. For each quantifier we count a single pebble, and 3A; for transitive closure, and 
compute the maximum needed over all sequences of nested operators in the formula. 
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When allowing transitive closure of arbitrary formulas (not requiring them to be func- 
tional) it is customary to restrict attention to formulas with only positive occurrences of 
transitive closure, i.e., within the scope of an even number of negations (see, e.g., [16, 35]). 
Using standard argumentation each such formula is equivalent to one where negation is 
applied to atomic formulas only. 

For such formulas there is a similar, nondeterministic, result as the one above. Atomic 
formulas and their negations are treated as above, and so are conjunction and universal 
quantification. For disjunction and existential quantification, the automaton uses nonde- 
terminism in the obvious way. For transitive closure, the Sipser technique we have used in 
the previous proof is not needed. For a formula (j) = (f>* the automaton A checks nondeter- 
ministically the existence of a path uq , u\ , . . . , u n from vertex x to vertex y in the directed 
graph determined by <p\ (described in the proof of Lemma l4.ip . When A is at vertex Hi, it 
proceeds to vertex Ui+i using 2k additional pebbles x! and y' , as follows. It drops x' on the 
current nodes Uj and nondeterministically chooses nodes u%+i, where it drops y' and checks 
that 4>i(x',y'). Then it returns to y' , lifts y', and retrieves x'. 

We denote the positive restriction of FO+TC fc by FO+posTC fc , and similarly for the 
deterministic case. Thus, for trees over a ranked alphabet, nondeterministic /c-head tree- 
walking automata can compute positive k-aiy transitive closure: FO+posTC fc C NPW fe A. 

5. From Nested Pebbles to Logic 

The classical result of Kleene [38] shows how to transform a finite-state automaton into 
a regular expression, which basically means that we have a way to dispose of the states of 
the automaton. Bargury and Makowsky [2| observe that this technique can also be used to 
transform multi-head automata walking on grids into equivalent formulas with transitive 
closure: transitive closure may very well specify sequences of consecutive positions on the 
input, but has no direct means to store states. A similar technique is used here. As our 
model includes pebbles, this imposes an additional problem, which we solve by iterating the 
construction for each pebble. Unlike [2] we have managed to find a formulation that works 
well for both the nondeterministic and deterministic case. 

Given a (deterministic) computational finite-state device with k heads on the tree, the 
step relation of which is specified by logical formulas, we show that the computation rela- 
tion that iterates consecutive steps can be expressed using k-aiy (deterministic) transitive 
closure. Of course, the consecutive positions of the heads along the tree are well taken care 
of by the closure operator, but here we additionally require that the states of the device 
should match the sequence of steps. 

Let be a Q x Q matrix of predicates 4> Ptq (x,y), p,q £ Q for some finite set Q (of 
states), where x, y each are k distinct variables occurring free in all <j) Pt q- We define the 
computation closure of $ with respect to x, y as the matrix consisting of predicates 
(j)p, q (x,y) where t \= 4>p t q(u,v) iff there exists a sequence of fc-tuples of nodes uq,u±, . . . ,u n 
and a sequence of states po,pi, ■ ■ ■ ,p n , n > 1, such that u = no, v = u n , p = po, q = p n , 
where t \= 4> Pl , Pl+1 (ui,Ui + i) for < i < n. (0) 

^Note that, to simplify the description of computation closure we have disregarded the remaining free 
variables of the P , 9 and More precisely, if zi, . . . , z m are all the free variables of all <j>p,g ( m addition 

to x, y), then each 0* q has free variables x, y, Zi, . . . , z m - In the definition of t \= (p^ q (Ti,v,wi, . . . ,w m ) the 
have fixed values Wi,..., w m . 
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Intuitively t \= 4>p,g(u,v) means that there is a $-path of consecutive steps (as specified 
by matrix $) leading from nodes u in state p to nodes v in state q. Note that only nonempty 
paths are considered (n > 1). 

We say that $ is deterministic if its predicates are both functional and exclusive, i.e., for 
any p, q,q' G Q and 3k nodes u,v,v' of any tree t, if both t \= (f> Piq (u,v) and t (= (f> p>q '(u,v') 
then q = q' and U = u'. Moreover, $ is said to be semi- deterministic if the previous 
requirement holds for final states q, q' only, where q is final if (j>q ir is false for all r € Q (and 
similarly for q'). 

Lemma 5.1. 

(1) //$ is deterministic, then is semi- deterministic. 

(2) //$ is in FO+TC fc ; ffcen so is $ # . 

(3) //$ is in FO+DTC fc and deterministic, then $ # is in FO+DTC fe . 

Proof. 1. Let q,q f £ Q be final, and assume that for tree i both (f)f :q (u,v) and 4> pq /(u,v') 
hold for p Q and 3/c nodes u,v,v' of i, with <&-paths of length n and n' as in the definition 
of computation closure (n, n' > 1). 

Consider the first steps of both paths. We have <fi PtPl (u,ui) and 4> PtP ' (u, u[ ) , as well as 



4 ) px,q{ u li v ) if n > 2, and <^r, if n' > 2. Due to the determinism of $ we conclude 

pi = p' x and ui = u[ . 

If n = 1, then u\ = v and p\ = q. Since p\ = q is final, (fft, q ,(ii?i,v') is false. Hence 

n' = 1 and v' = u\ = v and g' = pi = g as required. For n, n' > 2 we continue inductively 
with pi and «i . 

5. The proof is a logical interpretation of the method of McNaugton and Yamada [42J . 
Without loss of generality we assume that Q = {1,2, .. . , m}. We show by induction on 
i how to construct a matrix $W of formulas <j)p\ in FO+TC fc which are defined as d>v a, 
except that the intermediate states p\, . . . ,p n -i ar e chosen from {1, ...,£}. In particular, 
for £ = no intermediate states are allowed, whereas for £ = m all states are allowed, so we 
have = $ # . 

For £ = 0, the length of the path is one. This means that = 

Given $^ we obtain <j?(^ +1 ) as follows. Assume ^^(x, y) holds. Either there exists 
a <3>-path that does not visit state £ + 1 (i.e., pi / £ + 1 for all < i < n to be precise), or 
this state is visited one or more times during the path. In the former case <ppj q {x, y) holds, 
in the latter case we have a path from state p to state £ + 1, perhaps looping several times 
from £ + 1 back to itself, and finally there is a path from state £ + 1 to state q. Neither of 
these paths contains £ + 1 as intermediate state, so in this case 4>p^ O^j v) postulates the 
existence of intermediate nodes x 1 and y' such that 

5. In the previous part of the proof transitive closure was applied to predicates 4> e +i m- 

U) 

However, determinism of $ entails functionality of predicates of the form (p ri+1 , by an 
argument analogous to the one in 1. above. Note that state £ + 1 need not be final, but the 
paths to state £ + 1 cannot be extended because (by definition of <J>^) state £ + 1 cannot be 
visited intermediately. Hence, each transitive closure is applied to a functional predicate, 
i.e., it is a deterministic transitive closure. □ 
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Lemma 5.2. For trees over a ranked alphabet, and k > 1, DPW fc A C FO+DTC fe . 

Proof. Consider a tree-walking pebble automaton with k heads. We assume that (1) ac- 
cepting states have no outgoing instructions (i.e., if {p,x> a ) is an instruction, then p is 
not accepting), (2) the initial state is not accepting, and (3) if there is an instruction 
(p, dropj(x), q), then there is no instruction (q, retrieve(x), r). The latter two requirements 
are to ensure that accepting computations, and computations between dropping and re- 
trieving a pebble, are nonempty, allowing the use of Lemma 15.11 

Let the automaton use n pebbles, x n , ... ,xi, where pebbles are placed on the tree in 
the order given, i.e., x n is always placed on the bottom of the pebble stack. We view the 
automaton as consisting of n+1 'levels' A n , ■ ■ ■ ,Ai,Aq such that Ag is a A;- head tree- walking 
pebble automaton with £ pebbles x#, ... ,x\, available for dropping and retrieving, whereas 
pebbles x n , . . . ,xi+i have a fixed position on the tree and the automaton Ag may test for 
their presence. Basically, Ag acts as a tree-walking automaton that drops pebble xg, then 
queries pebble automaton Ag-i with £ — 1 pebbles where to go in the tree, moves there, 
and retrieves pebble xg (from a distance). 

We postulate that the number of pebbles dropped is kept in the finite control of the 
automaton, so we can unambiguously partition the state set as Q = Q n U • • • U Q\ U Qo, 
where Qg consists of states where £ pebbles are still available. The set Q n contains both 
initial and accepting states. Automaton Ag equals the restriction of the automaton to the 
states in Qg; we will not specify initial and accepting states for Ag, £ < n. 

We show how to express the computations of automaton Ag, I > 0, on the input tree as 
FO+DTC fc formulas, provided we know how to express computations of automaton Ag-\ 
if £ > 1. For Ag a matrix is constructed with predicates cf)p\ for p, q € Qg. These 
predicates represent the single steps of Ag, so t \= 4>p^{u, v) iff Ag has a nonempty com- 
putation from configuration [p, u, a] to configuration [q,v, a]. Note that $W has additional 
free variables x n , . . . , xg + \ that will hold the positions of the pebbles already placed on the 
tree, thus representing the pebble stack a. 

We first study the steps while the pebble xg has not been dropped. For each of its 
heads, automaton Ag may test the presence of one of the pebbles x n , . . . , xg + \, or the node 
label or the child number of the current node, or it may move the head up to the parent or 
down to a specified child. The semantics of these separate instructions, relations between 
the current and next configurations \p, u, a] and [q, v, a], are easily expressed in first-order 
logic. So, we have the following translation table: 

instruction: formula: 

(P> up,, q) Vj edg i (w[i], «[*]) A A/^i u[h] = v[h] 

(p, downij,q) edgj (u[i], v[i]) A f\ h ^ u[h] = v[h] 

(p,labj j(T ,g) lab <x («[z']) A f\ h u[h] = v[h] 

(P, Pebi(x m ),q) u[i] = x m A /\ h u[h] = v[h] 

(p,chnoij,q) (3u' ) edgj (u' ,u[i\) A /\ h u[h] = v[h] 

or in case of the negative tests ~labj jCr , ~pebj(x), and ~chnojj, the tests above are 
negated, whereas head positions remain unchanged, e.g., for (p, ~ labj i0 ., q) the formula 
is -i lab CT (ti[i]) A f\ h u[h] = v[h}. 

In general 4>^}j(u, v) is a disjunction of such formulas, as we may have parallel instruc- 
tions in the automaton. 
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Additionally when I > 1, Ai may drop pebble xt in state p, simulate Ae-i, and re- 
trieve pebble xt returning to state q. Such a 'macro step' from configuration [p, u, a] to 
[q,v, a] is only possible when there is a pair of pebble instructions (p, drop^ [xi) , p') and 
(</', retrieve (a;^ ), g), such that has a (nonempty) computation from \p',u, a'] to [(/, U, a'], 
with a' = Hence, At can take a 'step' from [p, u, a] to [q,v, a] if the disjunction 

of <fip, J, (tt, u) over all such g' holds, where the free variable in that formula is replaced 
by u[i], the current position of the i-th head of the automaton, i.e., the position at which 
that pebble is dropped. Note that in Ae-i, q' has no outgoing instructions (and hence q' is 
a final state of S^" 1 )*). 

Defining the remaining (f>pj q to be false, we obtain a step matrix $W, which is deter- 
ministic thanks to the determinism of the automaton and the semi-determinism of 
cf. Lemma l5.1f 1). It is in FO+DTC fc by Lemma I5.1f 3). The computational behaviour of 
the automaton At is expressed by <J>W#, in general, and more specifically for A n , by the 
disjunction of all formulas ^p n q*(root, root) with p the initial state and q an accepting state. 
Note that the last formula is correct by assumption (1) in the beginning of this proof. □ 

Combining the two inclusions in Lemma 14.11 and Lemma 15.21 we immediately get the main 
result of this paper. Note that it includes the case of strings. 

Theorem 5.3. For trees over a ranked alphabet, andk>\, DPW fc A = FO+DTC fc . □ 

As a corollary we may transfer two obvious closure properties of FO+DTC fc , closure un- 
der complement and union, to deterministic tree-walking automata with nested pebbles, 
where the result is nontrivial. These properties are a rather direct consequence of the 
always-halting normal form in the proof of Lemma 14.14 which can be obtained for every 
deterministic automaton. For k = 1, this normal form is further studied with regard to the 
number of pebbles needed in [451 [9]. 

Corollary 5.4. Let k > 1. For each deterministic k-head tree-walking automaton with 
nested pebbles we can construct an equivalent one that always halts. □ 

When the tree-walking automaton is not deterministic we no longer can assure the deter- 
minism of the formulas in the proof of Lemma 15.21 However, by Lemma l5.1T 2) they 
are in FO+TC fc . 

Theorem 5.5. For trees over a ranked alphabet, and k>l, NPW fe A C FO+TC fe . □ 

The constructions in the proof of Lemma 15.21 use negation in one place only: it is used on 
atomic predicates, to model negative tests of the automaton (to check there is no specific 
pebble on a node). Note that negation is not used to construct the formulas in the proof 
of Lemma 15.11 Hence we obtain positive formulas, where negation is only used for atomic 
predicates, and thus the inclusions DPW fe A C FO+posDTC fc and NPW fe A C FO+posTC fc . 
In the deterministic case negation of a transitive closure can (for finite structures) be easily 
expressed without the negation [28], thus in a positive way: FO+posDTC fc = FO+DTC fe . 
With that knowledge the first inclusion is not surprising; for the nondeterministic case we 
additionally find a new, positive, characterization (cf. the end of Section H]). 

Corollary 5.6. For trees over a ranked alphabet, andk>\, NPW fe A = FO+posTC fc . □ 
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As observed in the Introduction, we do not know whether NPW fe A is closed under comple- 
ment (i.e., whether 'pos' can be dropped from Corollary I5.6p . Using the method of [341 161] . 
it is easy to see that, for trees, U fceN NPW fc A *s closed under complement, which means it 
is equal to IJfceN FO+TC fc (and note that it also equals UfceN NW fc A). 

6. Single Head on Trees 

More than thirty years ago, single-head tree-walking automata (with output) were 
introduced as a device for syntax-directed translation pQ (see |23|). Quite recently they came 
into fashion again as a model for translation of XML specifications [33j [631 EH EH 1371 HP] . 

The control of a single-head tree-walking automaton is at a single node of the input 
tree. Thus it differs from the more commonly known tree automata. These latter automata 
work either in a top-down or in a bottom-up fashion and are inherently parallel in the sense 
that the control is split or fused for every branching of the tree. 

The power of the classic tree automaton model is well known. It accepts the regular 
tree languages (both top-down or bottom- up), although the deterministic top-down variant 
is less powerful. For tree-walking automata however, the situation was unclear for a long 
time. They accept regular tree languages only [361 [23], but it was conjectured in (and 
later in [211 [19l [TO] ) that tree- walking automata cannot accept all regular tree language^. 
This was first proved for 'one- visit' automata (for the deterministic case in |6l|30], and for 
the nondeterministic case in [48J). Recently the conjecture was proved, in a very elegant 
way, for deterministic tree- walking automata in [7J, and for nondeterministic tree- walking 
automata in [8] (see Examples 12.11 and 13 . 1 p . 

The reason that tree- walking automata cannot fully evaluate trees like bottom-up tree 
automata is that they easily loose their way. When evaluating a subtree it is in general 
hard to know when the evaluation has returned to the root of the subtree. In order to 
facilitate this, in [19] the single-head tree-walking automaton was equipped with pebbles. 
This was motivated by the ability of pebbles to help finite-state automata find their way 
out of mazes [3J. 

In [19] we have shown that all first-order definable tree languages can be accepted 
by single-head (deterministic) tree-walking automata with nested pebbles, and that tree 
languages accepted by single-head (nondeterministic) tree-walking automata with nested 
pebbles are all regular. 

As observed before, DSPACE(logn) is the class of languages accepted by single-head 
two-way automata with (nonnested) pebbles [331 EI]. Thus, for k = 1 (single-head automata 
vs. unary transitive closure), our main characterization for tree languages, Theorem 15 .31 can 
be seen as a 'regular' restriction of the result of Immerman characterizing DSPACE(logn); 
on the one hand only (single-head) automata with nested pebbles are allowed, while on the 
other hand we consider only unary transitive closure, i.e., transitive closure for <f>(x, y) where 
x, y are single variables. Note that unary transitive closure can be simulated in monadic 
second-order logic, which defines the regular tree languages. 

We compare the family of tree languages FO+DTC 1 = DP\N 1 A with several next of kin. 
In the diagram below we have five families of languages xW x A accepted by (single-head) 
tree-walking automata, which are either deterministic, nondeterministic, or alternating (D, 



'Although a footnote in [T] claims that the problem was solved by Rabin. 
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? FO+TC 1 
FO+posTC 1 = NPW^ 



? MSO = REG 
^ = AW : A 




FO+DTC 1 = DPW^Aj \[8] 

NW X A 



lfo m 



N, or A in x), and may use nested pebbles in case X contains P. Lines without question 
mark denote proper inclusion, those with question mark just inclusion. 

The inclusion LFO C DW^ was shown in [19], as well as FO C DP\N 1 A. The reg- 
ular language (aa)* cannot be defined in first-order logic, and shows that DW 1 A ^ FO. 
The strictness of DW^ C NW^ was shown in [7]; their example additionally shows that 
FO ^ DW^. The result of [8j shows even that FO % N\N 1 A, cf. Example ED Logi- 
cal characterizations of DW^^A and NW 1 A are given in [48], also using transitive closure 
(but with an additional predicate indicating the level of a node modulo some constant). 
All families considered here are contained in the family REG of regular tree languages 
that can be characterized by monadic second-order logic MSO [15, 62J. The inclusions 
FO+DTC 1 C FO+posTC 1 C FO+TC 1 C MSO are obvious. In [52] several logics for regular 
tree languages are studied; it is stated as an open problem whether all regular tree languages 
can be defined using monadic transitive closure, i.e., whether FO+TC 1 = MSO. 

Alternating tree- walking automata are considered in [60] . Alternation combines nonde- 
terminism (requiring a successful continuation from a given state) with its dual (requiring 
all continuations to be successful). It is not difficult to see that a (nondeterministic) top- 
down tree automaton can be simulated by an alternating tree- walking automaton, but the 
reverse inclusion is nontrivial: REG = AW 1 A. 

If, instead of with pebbles, single-head tree-walking automata are equipped with a syn- 
chronized pushdown or, equivalently, with 'marbles', then they do recognize all regular tree 
languages |36U23l[2~T] . both in the deterministic and nondeterministic case. Synchronization 
means that the automaton can push or pop one symbol when it moves from a parent to a 
child or vice versa, respectively. 

Questions. Several of the inclusions between the families of trees we have studied are not 
known to be strict, cf. the figure in this section. These are all left as open problems (but 
see below). So, for logics, are the inclusions FO+DTC 1 C FO+posTC 1 C FO+TC 1 C MSO 
strict? For tree-walking automata, are the inclusions DPW^ C NPW^ C REG strict, 
is NW 1 A C DPW 1 A? Considering the use of pebbles, is there a strict hierarchy for tree 
languages accepted by (deterministic) tree-walking automata in the number of pebbles 
these automata use? 
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The pebbles we have used in this paper are considered as pointers, and differ from the 
pebbles that can only be picked up at the node they are dropped. We conjecture that our 
type of pebbles is more powerful than the usual one. For nonnested pebbles the two types 
have the same power, even when the number of pebbles is fixed (as shown in Theorem 2.2 



Alternating tree-walking automata with classic (nested) pebbles were considered in 
[43j . where it is shown that these automata accept the regular tree languages. We did 
not investigate whether alternation together with pointer pebbles again yields the regular 
languages, i.e., whether APW 1 A = REG. 

Answers. Since the appearance of the report version of this paper, several of the questions 
above have been answered. 



In the recent [9] it is shown that, surprisingly, the two types of nested pebbles have 
the same power. Moreover, it is shown that the number of pebbles gives rise to a strict 
hierarchy, both for deterministic and nondeterministic automata, and that the inclusion 
NPW^ c REG is strict. 

In the very recent paper [ifi] it is shown that APW X A = REG holds, using an easy 
variation of the proof technique of [33] . 

Both tree-walking automata and logical formulas can also be used in a natural way to 
define binary relations on the nodes of trees, called trips in |21| . It is straightforward to show 
that Theorem 15.31 and Corollary 15.61 also hold for trips. XPath-like formalisms (relevant to 
XML) that are equivalent with, or closely related to, the trips defined in FO+posTC 1 and 
FO+TC 1 have recently been studied in [27lfl3] . 

Single head on strings. For strings the equivalence MSO = REG between monadic 
second-order logic and finite-state automata was obtained independently by Biichi, Elgot, 
and Trakhtenbrot [TT1 [TTJ [63J ■ As observed, tree- walking automata on monadic trees cor- 
respond to two-way finite-state automata on strings. Since the latter are equivalent to or- 
dinary finite-state automata (both deterministic and nondeterministic) |53l I58j most of the 
hierarchy is known to collapse in the string case: DW 1 A = REG. Hence for k = 1 our main 
result gives two additional characterizations of the regular string languages REG = NPW 1 A 
and REG = FO+DTC 1 . The second equality was shown in [2] (see also Exercise 8.6.3 of 



We generalize our results on trees (and strings) to more general families of graphs. 
There is a basic problem that has to be resolved. We have to propose a model of graphs 
that is suitable for both graph- walking automata and logic. The structures for which Im- 
merman has obtained his logical characterization of DSPACE(logn) are ordered, i.e., they 
are equipped with a total order <, which can either be specified directly or, as transitive 
closure is available in the logical language, by a direct successor relation. In fact, when 
placing a structure on the tape of a Turing Machine, as has to be done for DSPACE(logn), 
it almost automatically obtains an implicit order on its elements. A natural way to present 
a structure (assuming only relations of at most arity two) as input to a graph-walking au- 
tomaton is to represent all pairs in a binary relation as directed edges in a graph, labelled 
by a symbol representing the relation; unary relations and constants are translated as labels 
for the nodes. If we fully represent a total order < in this way, there will be many edges 



of 01)- 



PI], and Satz 2.0.1 of [52]). 
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with the same label leaving (and entering) each node. Although the edge to the direct 
successor can be directly specified in first-order logic we see no natural machine model that 
can choose this edge among the many candidates. 

A solution would be to assume the existence of direct successor edges in the graph, i.e., 
to consider 'ordered' graphs (cf. the end of the Introduction), but we feel this requirement 
is too restrictive. On trees, tree-walking automata use the preorder to traverse the nodes 
of the input. Thus, the order is implicit in the tree, and not explicitly given by edges with 
a special label. Below we generalize this idea to graphs: we do not assume an explicit total 
order on the nodes, but we postulate the existence of a special graph-walking automaton, 
the guide, that can be used as a 'subroutine' to traverse the nodes of any graph in the family 
of graphs we are considering. For trees the guide follows the preorder, for (two-dimensional) 
grids the guide visits the nodes row by row (with generalizations to higher dimensions). 

Graphs. We consider (families of) nonempty finite directed graphs, where nodes and edges 
are labelled using a common alphabet E. In order to be able to locally distinguish edges 
we require that no node has two outgoing edges or two incoming edges with the same label. 
Hence the graphs we consider are of bounded degree. Parallel edges and loops are allowed. 
Thus, each edge label represents a (partial) injective function on the set of nodes. We 
consider (weakly) connected graphs only. 

Trees over a ranked alphabet fall under this definition since we label the edge from a 
parent to its i-th child by i. Another example of such a family is formed by grids, where 
two different labels are used to distinguish edges to horizontal and vertical neighbours. (In 
our model there is no need to introduce special node labels that mark the boundaries of the 
grid, cf. [HE], as the labels of the edges incident to a node are available to our automata). 

Automata. A /c-head graph-walking automaton with nested pebbles is like its relative for 
trees in that it visits the nodes of a graph, each of its heads walking along the edges from 
node to node. It may check the labels of its current nodes, check whether such a node 
has an incident incoming/outgoing edge with a specific label (generalizing the concept of 
child number and the rankedness of E, respectively), check for the presence of a specific 
pebble (from a finite set), and it may then move each head along any edge in either direction 
choosing the edge based on its label. Moreover, it may drop and retrieve pebbles in a nested 
fashion, as before. 

Formally we change the operations and tests available to the automaton. The moves 
upj and downy are replaced by the more symmetric pair inmovej iCT , outmovej jCr to specify a 
move by the i-th head along an incoming (outgoing) edge with label a. Likewise we replace 
the child-number test clmoy by the tests inedge^ a and outedge^ a on the existence of an 
incoming (outgoing) edge with label a for the node currently under the i-th head. 

Generally graphs do not have a distinguished node (like the root is a distinguished node 
for trees) and we change the definition of acceptance accordingly. A graph-walking automa- 
ton with nested pebbles accepts a given input graph if the automaton has an accepting 
computation (starting in the initial state, halting in an accepting state) when started with 
all its heads on any node of the input graph; initially and finally there should be no pebbles 
on the graph. We require that the existence or nonexistence of an accepting computation is 
independent of the chosen initial node, as this seems a natural condition, especially in the 
context of determinism. Note that not all automata satisfy this requirement |j 

^Formally, we define L(A) to consist of all graphs g over E such that acc(u) for every node u of g, where 
acc(«) means that [qo,u k ,e] \~A,g [<?i^; e ] f° r some halting configuration [q,v, e] with q G A and v is a fc-tuple 
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For trees over a ranked alphabet this definition is obviously equivalent to the old one as 
a tree-walking automaton can always move all its heads to the root. Disregarding pebbles, 
for grids the definition is equivalent to the fc-head automaton of [2J (and for k = 1 to the 
2-dimensional automaton of 

In order to avoid an abundance of new notation, we keep the notation DPW fc A, etc., 
for families accepted by (deterministic) A:-head automata (with nested pebbles) even in the 
more general context of graphs. 

Logic. The first-order logic for graphs over the label alphabet E has atomic formulas 
lab a (x), a 6 E, for a node x with label a, edg a (x,y), a 6 E, for an edge from x to y with 
label a, and x = y. 

As for automata, we keep the notation for families defined by our first-order logic with 
several variants of transitive closure. Note that we do not allow the predicate x < y, which 
makes the logic more like the local variant of first-order logic. As we have seen in Section [2j 
for trees this predicate is definable in first-order logic with (positive deterministic) transitive 
closure, and the families FO+DTC fc , etc., of tree languages definable in first-order logic with 
transitive closure, do not change by this restriction. 

For arbitrary families of graphs the computation of a graph-walking automaton with 
nested pebbles can be specified in first-order logic with transitive closure, like in Section [5j 

Lemma 7.1. For every family of graphs, andk>\, DPW fe A C FO+DTC fc . 

Proof. In the proof of Lemma 15.21 where we consider trees instead of graphs, no reference 
is made to properties particular to that domain. The only place where the proof given has 
to be adapted is where the accepting condition is phrased in the logic. Here any node must 
have an accepting computation when we start the automaton with all heads on that node, 
so we write (\/x)(3y) V g gA ^pl^{ xk ^ v) where p is the initial state, the disjunction is taken 
over the set A of accepting states, and x k is the /c-tuple consisting of k copies of x. Note 
that we might as well take 3x instead of Vx. □ 

The reverse inclusion needs an additional notion, the ability to search each graph in a given 
family of graphs. 

Searchable Graphs. A family of graphs is searchable if there exists a (fixed) single- 
head deterministic graph- walking automaton with nested pebbles that, for each graph in 
the family, and each node of the graph, when started in that node in the initial state the 
automaton halts after completing a walk along the graph during which each node is visited 
at least once. Pebbles may be used in a nested fashion during the walk, as before. Thus 
the automaton serves as a guide for the family of graphs, and makes it possible to establish 
and traverse a total order of the nodes of the graph, generalizing the concept of preorder 
traversal used for trees. (In fact, the guide visits the same node perhaps several times, like 
in a preorder traversal on trees, but we will see that the unique first visit to a node can be 
recognized.) Note that the total order may depend on the node at which the guide starts 
its walk. Note also that if a family of graphs is searchable, then any subset of the family is 
searchable (by the same guide). 

of nodes of g (and u k is the fc-tuple of k copies of u). Alternatively, we could require this for some node u 
of g, because, as discussed above, A is restricted to satisfy the requirement that for all graphs g over E and 
all nodes ui,U2 of g, if acc(ui) then acc(«2). We note, however, that this restriction is not essential for our 
results: it is easily shown that for searchable graphs (see below) the restriction can be dropped. 
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We give several elementary examples of this notion. Trees over a ranked alphabet form 
a searchable family: from any node we may walk to the root to start a preorder traversal 
through the tree. Similarly binary trees -which are basically ranked trees where every node 
has rank two, but where either child of a node can be missing- are a searchable graph 
family. 

Unranked (ordered) trees, where there is no bound on the number of children of a 
node, are important in the theory of data representation using XML [47 \ \37 \ [57] . Here they 
are considered in their natural encoding as binary trees, where each node carries possibly 
two pointers (edges), one to its first child, one to its right sibling. In this way they are a 
searchable graph family. The (single-head) automata we obtain using this representation 
may move to the first child or to the next sibling of a node (and back), exactly as customary 
in the literature \47\ 150] (albeit without pebbles). For an overview on logics for unranked 
trees, see [39J. 

Rectangular (directed) grids, edges pointing to the right or downwards, with edge labels 
distinguishing these two types of edges, form another example. This can be generalized to 
higher dimensional grida^ [2]. 

All the above are superseded by acyclic, connected graphs with a single source (and the 
local restriction on edge labels). We can effectively turn those graphs into trees by placing 
an ordering on the labels and ignoring all incoming edges of a node except the edge which 
has minimal label among those edges. Graph-walking automata (with one head and no 
pebbles) on such graphs were considered in [36] . 

Obviously, a family of graphs such that, for some S'CS, the graphs induced by the 
S'-labelled edges belong to one of the above families, is searchable too. This includes the 
family of all 'ordered' graphs. 

The above families are traversed in a rather standard way. In the next example we use 
a pebble to find and 'break' a cycle. 

Example 7.2. Consider an ordinary binary tree, and take one of its leaves. If we identify 
this leaf with the root (i.e., we remove it and redirect the incoming edge to the root) we 
obtain a single directed cycle from which trees radiate, like charms hanging from a bracelet. 

We argue that the family of these graphs is searchable. If started on a node in the cycle 
a graph-walking automaton may drop a pebble at that node. This effectively reduces the 
graph to a tree with the marked node as root, and a preorder traversal can be made. 

We claim that a node on the cycle can be found, with the help of a single pebble, 
starting on an arbitrary node of the graph. Put the pebble on the initial node. Search the 
graph 'below' the pebble as if it is a tree; if indeed it is a tree, then we finally return to 
the pebble from below. Otherwise we are on the cycle, and we enter the leaf that has been 
identified with the root, and will find the pebble from above. Thus, in order to find a node 
on the cycle we repeat the above search, each time moving the pebble one node up while 
not on the cycle. □ 

Cyclic grids, or toruses, where the last node of each row has an edge to the first node of 
that row, and similarly for columns, can be searched using two pebbles. We search the grid 
row-by-row: the first pebble marks the position we start with (in order to stop when all 
rows are visited; we do not move this pebble during the traversal), the second pebble moves 
down in the first column to mark the position in which we started the row (in order to stop 

d-dimensional grid has nodes (xi, . . . , Xd) with Xi £ N, 1 < Xi < ki, for certain fci, . . . , kd- For each i 
it has i-labelled edges from (xi, . . . , Xi, . . . , Xd) to {x%, . . . , Xi + 1, . . . , xa), provided Xi < ki. 
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when we finish the row; we then move the pebble down to the next row until we meet the 
first pebble). This can be generalized to d pebbles in the d-dimensional caseJl 

It is an open question whether mazes (modelled as connected subgraphs of grids) form a 
searchable family of graphs, that is, with a single head and the help of nested pebbles. They 
cannot be searched without the help of pebbles, or with only a single (classic) pebble [T2|l31]. 
According to [5] these graphs can be searched by single-head graph-walking automata with 
two (classic) pebbles which are unfortunately not used in a nested fashion, or using two 
heads (without pebbles). 

The family of all graphs (over a given alphabet with at least two elements) is not 
searchable, not even with nonnested pebbles or with several heads. This follows from 
results of Cook and Rackoff [14J. The deterministic /c-head graph- walking automaton with 
n pebbles is a special case of the jumping automaton of p3] with k + n pebbles. A jumping 
automaton may move its pebbles along the edges of the graph, or move one of its pebbles 
to the location of another pebble ('jumping'). Retrievability of pebbles at a distance is a 
particular case of this jumping facility. Basically, Theorem 4.9] states that the number 
of pebbles needed to visit all nodes of a d-dimensional torus grows as a function of d. Then 
[141 Theorem 4.13] concludes there is no jumping automaton that searches all graphs over 
a three letter alphabet, by coding arbitrary graph alphabets (like the d dimension edge 
labels for toruses) into a three letter alphabet. A slight adaptation of the proof (because of 
the graph model used) makes it valid in our setting, with two letters instead of three. For 
classic pebbles (and a single head) it is shown in [55] that the family of all planar graphs is 
not searchable. 

For an overview of approaches to the exploration of mazes and graphs by finite automata 
see plE]. 

For searchable graphs one can obtain the converse inclusion from our previous result, 
and translate logic into automata. 

Theorem 7.3. For every searchable family of graphs, and k > 1, DPW fc A = FO+DTC fc . 

Proof. By the previous lemma it suffices to show FO+DTC fc C DPW fe A for searchable 
graphs, i.e., to reconsider the proof of Lemma l4.ll There is no need to return to a 'root' (we 
do not have one) to test acceptance by some automaton (the induction hypothesis used as 
a 'subroutine'); in the definition of acceptance we have required acceptance for any initial 
node, so we merely move all heads to the same position: drop a pebble, let each head search 
for it, and lift the pebble 

At several points in that proof the automaton is required to make a systematic traversal 
through the input tree, or more specifically, to find the preorder successor of a given node, 
a function easily implemented for trees. At some other points we are supposed to move a 
head to a specific position marked by a pebble: the pebble again is found by a systematic 
traversal of the tree. 

Here we instead use the 'guide', the automaton that makes the family searchable, as a 
subroutine to determine the successor of a node (and to make traversals of the graph). We 
have to mind some details, however. First, the guide may use its own pebbles during its 
computation, as is required to traverse a torus; leaving these pebbles on the graph would 

d-dimensional torus is a d-dimensional grid with additional i-labelled edges from (x\, . . . , ki, . . . , Xd) 
to (an, . . . , 1, . . . , x d ). 

^Dropping a temporary pebble is also a technique to show we may assume heads to be sensing, i.e., the 
automaton can check whether two heads occupy the same position. 
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block the removal of the other pebbles on the graph. Second, during the traversal of the 
graph the guide may visit the same node several times. In order to define the notion of 
successor properly, we have to distinguish a particular visit; for this we take the first visit 
to the node (as in the preorder of the nodes of a tree). Third, the order of visiting the nodes 
may depend on the node at which the guide is started, which makes the notion of successor 
undefined (think torus again). 

Thus, we need to explain how the unique successor can be found for each node in the 
graph (except the last). First, at the very beginning of its computation, the automaton 
we construct for a given formula drops a pebble on the node where it is started (i.e., the 
node where all its heads are initially positioned; this may be any node of the graph). This 
pebble, let us call it 'the origin', remains at its position during the full computation to 
serve as a fixed position in the graph. Every successor will be determined in the order when 
starting the guide in the origin. Now the task of finding the successor of a (marked) node 
using a single head is executed as follows. First we use the guide (from wherever we are) 
to move the head to the origin. When the origin is reached, we 'reset' the guide (removing 
its pebbles from the graph, returning to its initial state) and again start a traversal of the 
graph looking for our marked node. When we reach the marked node, we continue running 
the guide until we reach a node that is visited for the first time: our (well-defined) successor. 
In order to check whether a node is visited for the first time we leave all pebbles of the 
guide at their position and drop a new pebble on the node. We then start a copy of the 
guide at the origin and run it until we find the new pebble. If the copy of the guide is in the 
same state and has the same positions for its pebbles as the original guide, then it visits the 
node for the first time: due to its determinism the guide cannot visit a node twice in the 
same configuration. After this test we stop the copy of the guide and retrieve its pebbles. 
We either have found the successor node, retrieve the pebbles of the guide and proceed as 
needed, or we lift the new pebble and continue the guide. If the guide halts without finding 
a first visit, the marked node was the last node in the order. □ 

As in Corollary 15.61 we have in the nondeterministic case NPW fe A = FO+posTC fc for every 
searchable family of graphs, and k > 1. 

Multi-Searchable Graphs. It is open whether we can search a maze (a connected sub- 
graph of a grid) with a single head using nested pebbles. However with two heads we can 
search a maze [5] . To cover this family we need to extend the notion of searchability: as the 
automata we are dealing with have k heads, it seems only fair to extend the guide with this 
commodity. A family of graphs is k-searchable if there is a deterministic guide as before, 
which now may have k heads. The guide starts its computation with all its heads on an 
arbitrary node of the graph, and performs a traversal of the graph. Recall that the formal 
model assumes that at most one head moves in each computation step of the guide; this 
ensures that at most one node is visited for the first time, thus uniquely defining the order 
of the nodes. 

With this notion the results of the previous section can be extended to the larger class 
of fe-searchable graphs. But there is a catch: we have to extend our automaton model with 
a new instruction (p, jumpj(x), q) that moves a given head i to the position of a given pebble 
x (like the 'jumping' instruction from [14j). In the case of searchable graphs a single head 
can be moved to any pebble, just by running the guide using that head, searching for the 
pebble. For k > 1 however, in order to move a single head to a pebble, we need several 
heads, either introducing auxiliary heads (loosing the connection between number of heads 
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and arity of transitive closure) or loosing the position originally held by the heads needed 
for the search. (Even ignoring the problem of how to move all heads to the same initial 
position as needed for the guide.) 

The suggested additional jump-instruction is rather natural if we think of pebbles as 
pointers as we did before. If the graph-walking automaton stores a finite number of 'ad- 
dresses' of nodes, it can use a simple assignment to move a head to such a position. Adding 
this instruction is 'backward compatible': it will not change any of the previous results. On 
families of searchable graphs (which includes trees) the new instruction can be implemented 
as explained above. 

Note that if a family is fe'-searchable, then it is A;-searchable for k > k' . Thus, the next 
result generalizes Theorem 17.31 (which for trees is Theorem 15 . 3|) . 

Theorem 7.4. Let the automaton model be extended with the jump-instructions, as dis- 
cussed above. For every k-searchable family of graphs, k > 1, DPW fc A = FO+DTC fc . 

Proof. For the inclusion DPW fe A C FO+DTC fc we note that the new instruction (move head 
i to pebble x) is directly expressible in the logic as we assume variables for the positions of 
heads and pebbles, cf. the proof of Lemma 15.21 

For the converse inclusion FO+DTC fc C DPW fc A we once more carefully inspect the 
proof of Lemma 14.11 or rather the proof of Theorem 17.31 

As observed in the latter proof, the guide is used for two purposes: (1) to find the 
successor of a node, and (2) to move a specific head to a specific pebble (and note that 
(2) is also used in (1): a kind of bootstrapping). For (2) we now use the new instruction, 
whereas for (1) we use the fe-head guide. Note that to initiate the guide, all heads can be 
moved to the origin by the new instruction. Note also that before initiating the copy of the 
guide, pebbles should be dropped on all current head positions of the guide (to be able to 
compare them with those of the copy of the guide, and to restore them in case the successor 
has not been found). □ 

Since mazes are 2-searchable, we obtain that for mazes and k > 2, DPW^A = FO+DTC fc . 

Again we obtain a similar result for the nondeterministic case: NPW fc A = FO+posDTC fc 
for every fc-searchable family of graphs. 

As discussed before Theorem 17.31 our A:-head graph-walking automaton with nested 
pebbles is a special case of the jumping automaton of [14J; this includes the 'jumping' 
instruction of moving a head to the position of a pebble. Thus, by the results of |14j . the 
family of all graphs (over £ D {0, 1}) is not A:-searchable for any k. 

According to Theorem 17.41 the existence of a A:-head guide for a family of graphs 
entails that every language in FO+DTC fc can be implemented by a A;- head graph- walking 
automaton with nested pebbles. This is obvious for the language Lq defined by the formula 
(Vx) labo(x): it can be checked directly by the guide whether every node visited has label 
0. It is easy to see that this also works the other way around: assuming X D {0, 1}, an 
automaton for Lq must visit all nodes of any input graph that has only node label 0, since 
otherwise it also accepts that graph with the labels of unvisited nodes changed into 1. The 
automaton can be turned into a guide for the family by making it behave as if every node 
label equals 0. Here we require that the family is node-label-insensitive: membership of a 
graph in the family does not depend on the label of any node of the graph. For instance, 
the families of unranked trees, grids, and toruses we considered are node-label-insensitive. 
Thus, a node-label-insensitive family of graphs is /c-searchable iff Lq G DPW fc A. Hence for 
such a family Theorem 17.41 also holds in the other direction: if J 7 is a node-label-insensitive 
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family of graphs, then for every k, T is fc-searchable iff DPW fe A = FO+DTC fc . Moreover, 
T is fc-searchable for some k iff DPWA = FO+DTC, where DPWA = \J keN DP\N k A, and 
FO+DTC = UfceN FO+DTC fc . Moreover, this holds iff [J keN DW fc A = FO+DTC, assuming 
that the /c-head graph-walking automaton (without pebbles) has an additional instruction 
to move one head to another (i.e., it is the jumping automaton of |14j). 

Applying this to the family of all graphs, which is clearly node-label-insensitive, we 
note that the inclusions DPW fc A C FO+DTC fe (cf. Lemma EU and DPWA C FO+DTC are 
strict, the language Lq being in the difference. 

Pebbles left unturned. An obvious question that remains open is whether there exists 
a strict hierarchy in DSPACE(logn) with respect to the number of heads used (allowing 
nested pebbles), or equivalently with respect to the arity of transitive closure. The same 
question can be asked for trees and grids. For arbitrary graphs, strictness of the hierarchies 
FO+TC fc and FO+DTC fc is shown in [29]. Some other unresolved questions were stated in 
Section [6l regarding single-head automata on trees. The question concerning the nature 
of our nested pebbles, whether the pointer model is more powerful than the classic model, 
remains interesting for multi-head automata. Another question is whether our results can 
be generalized to alternating automata and the alternating transitive closure operator of 
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